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Shock  Waves,  Increase  of  Entropy  arid  Loss  of  Information 


By  Peter  D.   Lax" 


1.  Introduction 

We  present  an  informal  review  of  the  topics  in  the  title  as 
they  pertain  to  solutions  of  hyperbolic  systems  of  conservation  laws. 
These  are  systems  of  the  form 

(1.1>      u^   +  f^  =  0.  i  =  1,   ....  n; 

the  subscripts  t  and  x  denote  partial  derivatives.  Each  uMs  a 
density,  f'  the  corresponding  fiux.  Each  f"  is  a  function  of  all  u-''s,  so 
f'  can  be  expressed  as  a  linear  combination  of  u^.  In  matrix 
notation   (2.11  can  be  written  as 

(1.2;>      u,  ^  Au^  =  0 

wnere  u  denotes  the  column  vector  with  components  u^.  and  A  the 
matrix  whose  i"'^   row  is  the  u-pradient   of  f  : 

(:.3>       a  .  =  -^ 

The  matrix  A  is  a  function  of  u,  unless  f'  are  linear  functions  of  u:  m 
this  talk  we  deal  with  systems  that  are  genuinely  nonlinear,  in  a  sense 
to  be  made  precise 
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The    system    (1.2)    is    called    strictly    Kvperbolic    if    the    matrix    A 

has    real    and    distinct    eigenvalues    a^,    a2 a^    for    all    values    of 

u. 

We  are  interested  in  solving  the  initial  value  problem: 

(1.4)  u(x,0)  =  Uq(x). 

in  particular,  we  want  to  study  the  nature  of  the  dependence  of 
solutions  on  their  initial  data.  We  denote  by  S(t)  the  operator 
relating  solutions  at  time  t  to  their  initial  values: 

S(t):u(x,0)  -   u(x,t). 

The  main  facts  of  life  : 

(i)  The  initial  value  problem  has  no  proper,   i.e.,   differentiable 

solution  for  all   time  t,   no  matter  how  smooth  the  initial  data  are. 

(ii)  The  initial  value  problem  can  be  solved  for  all  time  if  we  admit 
solutions  of  (1.1)  in  the  integral  sense,  i.e.,  in  the  sense  of 
distributions. 

Solutions  in  the  distribution  sense  that  are  piecewise  continuous 
satisfy  the  Rankine-Hugoniot  relation 

(1.5)  sCu']    =    [f'],   i  =  1 n, 

where  [  ]  denotes  the  difference  between  the  value  on  the  left  side 
and  the  right  side  of  the  discontinuity;  s  is  the  velocity  with  which 
the  discontinuity  propagates. 

(iii)        Solutions  in  the  distributions  sense  are  not  determined  uniquely 
by  their  initial  data. 

In  these  notes  we  shall  describe  various  criteria  that  are  used 
to  accept  or  reject  distribution  solutions.  The  criteria  are  suggested 
by  physical  facts,  and  are  analyzed  mathematically.  The  analysis 
shows,   or  at  least   ieads  one  to  expec:. 


(iv)        The  various  criteria  all  picit  out  the  same  class  of  distribution 
solutions;  we  shall  call  these  relevant  solutions. 

(v)  Each  member  of  the  class  of  relevant  solutions  is  uniquely 

determined    by    its    initial    data.       The    initial    data    can    be    prescribed 
arbitrarily  within  the  class  of  L     functions. 

We  denote  by  S(t)  the  operator  linking  relevant  solutions  to 
their  initial  data. 

The  main  theme  of  this  talk  is  the  following  remarkable 
property  of  relevant  solutions: 

(vi)        The  set  of  relevant  solution  is  compact;  in  particular,  the 
operators  S(t),  t>0,  are  compact  in  appropriate  pairs  of  topologies. 

Note  that  (vi)  is  a  nonlinear  property;  for  linear  hyperbolic 
equations  S(t)  is  invertible,  in  most  cases  unitar>\  In  the  rest  of 
these  notes  we  shall  explore  how  noniinearity  brings  about 
compactness. 

Very  little  is  known  about  the  compactness  of  S(t)  in  more  than 
one  space  dimension. 

The  organization  of  these  notes  is  as  follows; 

In  Section  2  we  discuss,  for  the  simplified  model  of  a  single 
conservation  law.  the  concepts  of  genuine  noniinearity,  breakdown  of 
classical  solutions,  solutions  in  the  distribution  sense  and  their 
nonuniqueness,  the  viscosity  method,  finite  difference  methods,  and  the 
shock  condition. 

In  Section  3  we  discuss,  for  the  scalar  model,  the  compactness 
of  solutions  constructed  by  the  viscosity  and  difference  methods,  and 
derive  the  entropy  inequality  for  such  solutions. 

In  Section  4  we  derive  Giimm's  estimate  for  the  total  variation 
of  solutions  of  scalar  equations  that  satisfy  the  shock  condition,  we 
show  that  a  discontinuous  solution  that  satisfies  th'-  shock  condition 
also  satisfies  the  entropy  condition. 

In  Section  5  we  indicate  how  to  extend  the  notions  developed 
in   Sections   2,   3,   and  4  for  systems 


Section  6  contains  scattered  remarks  about  the  equations  of 
compressible  flow:  the  increase  of  entropy,  some  consequences  of 
Carnot's  theorem,  and  the  equipartition  of  energy  in  the  wake  of 
strong  shocks. 

Compactness  of  the  family  of  all  solutions  places  a  limit  on  the 
amount  of  information  contained  in  that  family.  It  is  natural  to  ask 
what  this  implies  about  the  amount  of  computing  needed  to  solve  an 
initial  value  problem  to  meet  a  specified  degree  of  resolution;  see  [14 J 
for  some  crude  notions. 

[13]  contains  a  bird's  eye  view  of  conservation  laws;  [16]  is 
a  thorough  treatise. 

2.  Single  Conservation  Laws 

We  study  (1.1)  for  n  =  1; 


(2.1)       "t  *  ^x  ^  °' 


f  some  function  of  u;  we  denote 

df 

(2.2)       =  a(u). 

du 

As  in  (1.2),   Equation  (2.1)  can  be  rewritten  as 

(2.3)  u^  -t-  a  Uj^  =  0; 

this  equation  asserts  that  u  is  constant  along  trajectories  x  =  x(t) 
satisfying 

dx 

(2.4)  —  =  a. 
dt 

In  view  of  this  interpretation,  a  is  called  the  signal  velocity:  the 
trajectories     defined     by     (2.4)     are     called     characteristics.  Since 

according  to  (2.3),  u  is  constant  along  characteristics,  and  since  a  is  a 
function  of  u,  it  follows  from  (2.4)  that  the  characteristics  are 
straight  lines. 


When    the    initial    value    Uq    of    u    is    specified,    we    can    construct 
through  each  point  (y,0)  on  the  initial  line  a  characteristic   line: 

(2.5)  X  =  y  +  a(uQ(y))t. 

Suppose  y^   and  y2  are  two  points,  y-i "^Yo-  ^^^  ^^^  inequality 

(2.6)  a(uj)>a(u2>, 
hoids,  where 

(2.6)'     u-j   =  UQ(y^),   Ug  =  Uo^^2*' 

(2.5)      shows      that      the      two      characteristics      issuing      from      (y^.O), 
respectively,   intersect   at   the  time 


y2   -   y 
T  = : 


a(u.)     -     aCu^) 


As  we  saw  before,  u  has  the  value  u-,  along  the  whole  characteristic 
issumg  from  (y-  .0).  and  the  value  u^  along  the  characteristics  issuing 
from  (y2.0).  So  at  the  point  of  intersection  u  has  to  be  equal  to  both 
u-  and  Uo:  since  (2.6)  shows  that  u.  x  Uo,  this  is  impossible  and 
shows   that   no  differentiabie  solution  u(x.t)   exists   beyond  the  time  T. 

Note   that   the   crucial   inequality   (2.6)   can   hold  only   if  a(u)  is   a 

genuine   function   of   u,    which   makes   f(u)   a   genuinely  nonlinear  function 

d 
of    u.       It   is    convenient    to    assume    that    —   a    *    0   for   all    u;    in   view 

du 
of  (2.2)  this  implies  that 


(2.7)      s:  0. 

du  *" 

i.e..   that  f  is  strictly   convex  or  concave. 


As  noted  in  the  introduction,  solutions  that  cannot  be  continued 
beyond  some  critical  time  T  can,  nevertheless,  be  contrived  as 
solutions  of  {2.11  in  the  sense  of  distributions: 

(2.8)       n       (u  w^  +  f  Wj^)  dxdt  +  Juq  Wgdx  =  0 

for  every  test  function  w  in  Cq.      This  is  consistent  with  the  point  of 
view  of  physics  that  conservation  laws  are  integral  relations: 


d 
(2.8)' 


—    f     udx  +      f      f-ndS  =  0 


d   G 


for    every    domain    G.       Relations    (2.8)    can    be    deduced    from    relations 

(2.8)  by  a  simple  process  of  approximation. 

As  remarked  in  Section  1.  a  piecewise  continuous  solution  is  a 
solution  in  the  distribution  sense  if  the  Ranicine-Hugoniot  relation  (1.5) 
is  satisfied  across  the  discontinuity; 

(2.9)  s[u]    =    [f], 

where    [    ]    denotes    the    difference    across   the    discontinuity,    and    s    is 
the  velocity  with  which  the  discontinuity  propagates. 

A  simple  example  of  such  a  discontinuous  solution  of  Equation 
(2.1),  with 


(2.10)    f(u)  =  u^/2 


r    1     for    X    <    t  /: 
2.11)    u-(x.t)  =  \ 

^  [    0     for     t /2     <     : 


The  discontinuity  is  across  the  line 


(2.12)    X  =  I  2 


which    propagates   with   velocity   s   =   1/2;    across   the  discontinuity    [u] 
=  1    [f]   =  1/2.  so  relation  (2.9)  is  satisfied. 
The  function 


rO     for     X     <     t/ 
(2.13)    u,(x,t)  =  i 

^  L    1      for     t/2     <     : 


also  is  discontinuous  across  the  line  (2.12),  and  [u]  =  -1. 
[f]  =  -1/2,  so  that  relation  (2.9)  is  satisfied.  Note,  however,  that 
the  function 


(2.14)    U3(x,t)  = 


0  for     X     ^     0 

x/t     for     0     $     X     $     t 

1  for     t     $     X 


satisfies  Eauation  (2.1)  with  f  given  by  (2.10),  so  that 


(2.15)     u,    +  uu     =  0, 


in  each  of  the  three  regions,  and  is  contipuous  ac-oss  the  boundaries 
X  =  0  and  x  =  t  separating  the  regions.  Since  U2(x,0)  =  Uo(x.O), 
solutions  in  the  distribution  sense  are  not  unigueb'  determined  by  their 
initial  data.  This  shows  that  an  additional  criterion  is  needed,  based 
on  physical  principles  and  buttressed  by  mathematicai  ones,  which 
rejects  certain  distribution  solutions.  The  remaining  acceptable  ones 
must  have  the  property  that  every  initial  value  problem  has  exactly 
one  acceptable  solution. 

There  are  several  ways  of  formulating  such  a  criterion  of 
rejection  or  acceptance;  happily,  they  turn  out  to  be  equivalent.  We 
list  the  most  important  ones: 

(a)  The  acceptable  solutions  u  of  (2  1)  are  the  limits  of  solutions 

u  of  a  family  of  equations  obtained  by  augmenting  the  flux  f  by  a 
small  viscous  term,  and  letting  the  viscosity  tend  Co  0: 


(2.16)    u  =   liB  u'^*. 
C-.0 


The    viscous    term    in    the    flux    is    -cu_.    €>0,    so    that    u'^     satisfies 
the  equation 

(2.17)    u^^>   *  f^^^    =   eu^^>.       f'^>  =  f(u<^>). 

and  has  the  same  initial  value  as  u: 


(2.18)    u'*^^(x,0)  =  Uq(x). 


(b)  The  acceptable  solutions  are  limits  of  solutions  v  of  a 

difference  approximation  as  At,  Ax  tend  to  zero.  Denoting  the  value 
of  v  at  X  =  kAx,  t  =  nAt  by  v[^,  the  difference  equation  is  of  the 
form 

here  we  use   the  abbreviation 

The   initial  values  are 


^0' 


The  rationale  for  using  limits  of  (2.19)  is  the  close  relation 
between  (2.191  and  (2.17):  this  can  be  seen  by  using  the  Taylor 
approximations 

v"^!^^    =  u  +  At  Uj.  +  -  (At»2  u^.,  +  O(A^) 


2 

1 
k-f  1      ■     "k-  1  I     ~    "     ■    "    '"■^'      "xx 


-    Iu.  ",    +  u    ■',  I    =  u  +  -  (Ax)2  u„„  +  O(A^) 


2     I  1  2 

1 


-,[h:.  -f.-i]  =  ^>".  * 


OiA-^). 


Setting    these     into     (2.19)     and     using     (2.1)     to     calculate    the    higher 
derivatives  of  u  we  get  that  v  approximates  solutions  of  (2.17),  with 


At 
(2.20)     €   =  — 
2 


Since   £  has  to  be  positive,  we  must  have 

Ax 

(2.21)    —  ^    lal; 

At 

this  is  tbe  celebrated  Courant-Friedricks-Lewy  convergence  criterion. 

We  remark  that  (2.19)  is  merely  one  of  a  variety  of  difference 
approximations     we     may     use.  Another     class     of     approximations, 

combining  viscosity  in  space  and  discretization  in  time,  is  Avron 
Douglis'  layering  method;  it  is  more  flexible  than  either  method  (a)  or 
(b),  see    [3]. 

An  entirely  different  criterion  for  accepting  or  rejecting 
distribution  solutions  can  be  based  on  the  analysis  of  the  mechanism 
that  causes  the  breakdown  of  smooth  solutions:  the  intersection  of 
characteristics.  Thus,  the  characteristics  issuing  from  the  initial  line 
for  the  initial  data  of  u^,  given  by  (2.11),  cross  in  a  wedge-shaped 
region 


Fig.   1 


The   role   of   the    discontinuity   in    (2.11)   along   x   =   t/2   is   to   keep   the 
characteristics   from   crossing.      This   is   in    contrast   to   the   behavior   of 


the    characteristics    issuing    from    the    inititil    line    for    the    data    of   u,, 
given  by  (2.13);  these  diverge  and  don't  cross  at  all: 


Fig.  2 


This  leads  to  the  concept  of  a  shock: 

A  discontinuity  of  a  piecewise  continuous  distribution  solution  is 
called  a  shock  if  the  characteristics  on  either  side  impinge  in  the 
forward  t  direction  on  the  discontinuity.  Denoting  by  Ur  and  Up  the 
value  of  u  on  the  left  and  right  sides  of  the  discontinuity,  and  by  a, 
and  ap   the  corresponding  signal  velocities: 

(2.22)    ar    =  a(ur ),   ap  =  a(up), 

we  can   express  this  condition  by  the  inequality 


(2.23)    aj^  >  s   > 


^R- 


where  s  is  the  velocity  of  propagation   of  the  discontinuity.     We  recall 
from  (2.9)  that 


(2.24)    s  = 


where 


10 


(2.25)    f^  =  flu^).  fR  =  f(uj^). 

(c)  A  distribution  solution  of  (2.1)  is  acceptable  iS  all  its 

discontinuities  are  shocks,  i.e.,   satisfy  condition  (2.23). 


3.  Viscosity  Methods 

As   a   start,    let's   assume   that   for   arbitrary   bounded   measurable 

initial    data    Uq    of    compact    support,    the    solutions    u      (x,t)    of    (2.17), 

(2.18)   exist   and   converge   in   the   L  (dxdt)   norm  over   (R  x  (0,T),   T  any 

value  >  0.     What  can  we  deduce  about  the  limit  u? 

2 
Multiplying   equation    (2.17)    by   any   Cq    tes 

integrating  by  parts  over  all  x  and  t  >  0;  we  get 


2 
Multiplying   equation    (2.17)    by   any   Cq    test   function   w(x,t)   and 


(3.1)  -{{    [u^^'w^  +  f(u*^)Wj^1    dxdt  -  J  UpWgdx  =   «   J  J  "  w^  dxdt. 

As    e    tends    to   zero,    u         tends   to   u   and   f(u      )   tends   to   f(u)   in   the 
L      norm.        Therefore,     (3.1)     tends     to     relation     (2.8),     characterizing 
distribution    solutions;    this    shows    that    the    L      limit    of   solutions    u 
of  (2.17)  is  a  solution  of  (2.1)  in  the  distribution  sense. 

Let  u  be  a  solution  of  (2.17);  since  the  initial  data  Uq(x)  has 
compact  support,  u  (x,t)  tends  to  0  rapidly  as  Ixl-^o*.  So  the 
function 

(3.2)  U"^^(x.t)  =    f        u*^*(y.t)dy 

is  well  defined  and  bounded.     Clearly, 

(3.2)'      U^^>    =  u<^>. 

Integrating  (2.17)  with  respect  to  x  gives 

(3.3)  U^^>.f[u^^)j    =   cU<^>, 


1  1 


provided  that  the  flux  is  normalized  so  that 

(3.4)  f(0)  =  0. 

Let  v'  be  another  solution  of  (2.17),  V^^'  its  x-integral, 
satisfying  an  analogue  of  (3.3).  Subtracting  the  two  equations  from 
each  other,  and  applying  the  mean  value  theorem,  we  deduce  that  the 
difference 

(3.5)  D  =  U*^*  -  V^*^> 
satisfies 

(3.6)  D,  .  a-D^  =   cD^, 
where 


a  =  

c  e 

U  -       V 


We    apply    now    the    maximum    principle    to    the    parabolic    equation    (3.6) 
and  deduce  that 


(3.7)       iDl  (  t  )   =  Max    iD(x,t)l 
max  X 


is  a  decreasing  function  of  t. 
Clearly, 


'^^^''<^>'max^   L    '""*"^t)idt. 


T  T 

(3.8)       f    lU*^*lCt)dt^    f    f      lu'^'(x,t)ldxdt 
Jo  max  •"oJfR 


12 


By    assumption,     u'^*     converges     to    u    in    the    L      norm;     it  follows, 

therefore,   from   (3.8)  that   U^^^  converges  to  U  in  the   L^(Max)  norm  on 

the    left    of    (3.8).       Similarly,    V*^'    converges    to    V;    by    the  triangle 
inequality,  for  each  t 

It        foUows        from        this        that         |U<^^-V^l(t)         tends        to 

max 

lU  -  Vl(t)    in   the   L-^(dt)   norm.      Since   by   (3.5)   and   (3.7)  the  former 


na  X 


is    a   decreasing   function   of   t,    and   since   the    L      limits   of   decreasing 
functions  are  decreasing,  it  follows  that 

(3.9)       lU  -  Vl  (c) 
max 

is  a  decreasing  function  of  t. 

The  quantity  •  U I  ^^^  is  called  the  W"^'°°  norm  u;  so 
property  (3.9)  can  be  expressed  in  terms  of  the  solution  operator  S(t) 
relating  initial  data  to  data  at  time  t: 

Theorem    3.1:        The    operators     S(t)    are    contractions    in    the    W"  ' 

norm. 

Since    equation    (2.17)    is    parabolic,    another    application    of    the 

maxuiium  principle  shows  that 

Max    iu'^*(x,t)l    =    lu'^'l  (t) 


max 


is  a  decreasing  function  of  t.  We  conclude,  as  before,  that 
lu^l(t)  tends  to  lul(t)  in  the  L^  norm,  and,  therefore,  that  the 
latter^also  is  a  decreas'lng"  function  of  time.  This  property  can  be 
expressed  so: 


1  3 


Theorem    3.2.        The    operators    S{t)    map    into    itself    any    ball    in    L 
centered  at  0. 

Theorems    3.1    and    3.2    derived    in    [8];    it    was    observed    there 
that  they  have  the  following  surprising  consequence: 

Theorem  3.3:  Suppose  equation  (2.1)  is  genuinely  nonlinear,  i.e.,  the 
function  f  is  strictly  convex  or  concave.  Then  the  operators  S{t),  t>0, 
map  any  bounded  subset  of  L  supported  on  a  given  interval  into  a 
compact  subset  of  L  . 

Proof:  Let  u  q  be  a  uniformly  bounded  sequence  of  functions 
supported  on  a  common  interval  of  the  x-axis.  Such  a  set  belongs  to 
a  compact  subset  of  W~  ,  i.e.,  a  subsequence  of  u  q  converges  to 
a  limit  u^  in  the  W~  '  norm.  This  limit  is  also  bounded  and  has 
compact  support.  According  to  Theorem  3.1.  for  each  t,u  (x,t) 
converges  in  the  W~  •'"  norm  to  u(x,t).  where  u  and  u  are  the 
solutions  of  (2.1)  constructed  by  the  viscosity  method  with  initial 
values  u  Q  and  Uru  respectively.  Both  u  and  u  satisfy  Equation 
(2.1)  in  the  distribution  sense  (2.8).     for  anv-  Cq   test  function  w. 


Wndx  =  0, 


(3.10)     JJ    fu*"'w^  ^  f(u'"*)w   1    dxdt  *  J  u^^^   Wq 


(3.10r    ff  (u  w^   T  f(u)w^)  dxdt  +  J    UQWgdx  =  0 

Since  u*'^  tends  to  u  in  the  W""'"  norm,  it  follows  that  the  first 
and  the  third  term  in  (3.10)  tends  to  the  first  and  third  term  in 
(3.10r      Therefore,  it  follows  that  so  does  the  second  term: 


(3.11)     j^    [f(u'"'»  -  f(u)j 


w^  dxdt  -,   0 


Next   we   make   use   of   the  fact   that  not  only  for  genuine  solutions   but 
also    for    distribution    solutions,     si^ais    propagate    with    finite    speed. 
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Since  u  q  =  Uq  =  0  outside  some  finite  x-interval,  it  follows  that 
u  "  and  u  are  zero  outside  a  bounded  act  in  x,t  space  for 
0  ^  t  $  T.  We  choose  w  so  that  w^  =  1  on  this  bounded  set;  since 
f(u")  =  f(u)  outside  this  set.  we  can  rewrite  (3.11)  as 


(3.11)'    jj    [f*"'"'>  -  f'">]    ^-^t  -   0- 


We  have  assumed  that  f  is  strictly,   convex,  say  f^fu)  ^   R  >  0.     Then 

(3,12)    f(u'"')  -  flu)  ^  f(u)(u*"*  -  u)  +  -  (u'"'  -  u)2 

2 

Integrating  this  we  get 

(3.12)'   jj    [f{u*°')  -  f(u)1    dxdt  ^  jj    [flu)  (u'°>  -  u)l    dxdt 

^l  II    fu'-'-uPdxdt 

Since  u  ''*  -  u  tends  to  zero  in  the  W~  '^  norm,  and  since  u  -  u 
is  un-formiy  bounded,  it  follows  readily,  by  approximating  f'(u)  in  the 
L"  ser.se  by  smooth  functions,  that  the  first  term  on  the  right  in 
(3.12"-  tends  to  zero.  Since  by  (3.11)'  the  left  side  tends  to  zero,  we 
deduct-  from  (3.12)'   that 


(3.13!     J    f  (u'"'   -  u)^   dxcr  -   0. 

Since  a'"    -  u  is  supported  on  a   bounded  set,  it  follows  that 

(3.13)     f   J    lu*°'  -  ul    dxdt  -  0. 

This  proves   L''(dxdt)  convergence   of  u  ^ 

To  prove  L'^(dx)  convergence,  we  appeal  to  the  following 
theorem  of  Barbara  Keyfitz    [7]: 

If  u  and  V  are  two  distribution  solutions  of  (2.1)  that  are  limits 
of  solutions  of  (2.17).   then 
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J    lv(x,t)  -  u(x,t)l    dx 


is   a   decreasing  function   of  time.      This  result   applies  in   particular  to 
V  =  u     ;  from  this  and  (3.13)'  we  conclude  that 


lu'"*(x.t)  -  u(x,t)l    dx  -   0 


for  each  t>0.     This  completes  the  proof  of  Theorem  3.3. 

It  was  already  observed  in  [8]  that  Theorems  3.1,  3.2  and  3.3 
hold  also  for  solutions  constructed  by  the  difference  scheme  (2.19);  the 
same  is  true  when  solutions  are  constructed  by  more  general 
difference  schemes,  as  long  as  they  are  of  monotone  type. 

A  basic  hypothesis  of  Theorems  3.1-3.3  is  that  the  solutions 
u  of  the  parabolic  equation  (2.17)  converge  to  a  limit  u  in  the  L 
norm  for  all  bounded  initial  data  Uq.  How  does  one  prove  such  a 
result?  In  the  case  (2.10)  of  a  quadratically  nonlinear  f,  E.  Hopf, 
used  the  fact  that  in  this  case  Equation  (3.3)  is  changed  by  the 
transformation 

U<^>  =  -2€log  V<^' 

into 

V^^^    =   £   V^^\  V(x.O)  =  exp   C\]^l^'2€j 

A  solution  of  the  heat  equation  can  be  expressed  as  an  integral  of  its 
initial  values.  Using  this  formula.  Hopf  was  able  to  show  that  as 
€  -t  0.  u  '  tends  to  a  limit  u;  he  even  obtained  a  fairly  explicit 
formula  for  this  limit.  It  was  remarked  in  [9]  that  a  version  of 
Theorem  3.3  can  be  derived  from  this  formula. 

The  convergence  of  solutions  of  the  difference  scheme  (2.19) 
can  be  proved  in  a  similar  fashion  for  the  special  choice  f(u)  =  log  (a 
+  be~"),    and    of    other    monotonic    schemes    for    other    special    choices. 
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Needless  to  say,   these  methods  are  very  special.     Recently,   see    [15], 

[17]    a   new   method,    capable   of   dealing   with   more   general   cases,   has 

been      introduced     by     L.   Tartar     and     F.  Murat;      we     give     a     brief 

description  of  their  ideas. 

The    first     idea,     going     back     to     L.  C.     Young,     is     a     precise 

(  €) 
description   of  weak    convergence.      Let    u  ( y )    be   a   uniformly   bounded 

sequence    of    functions;    then    it    has    a    subsequence    with    the    property 

that   for   every   continuous  function   g  the  weak     limits   in   the  sense  of 

L      exist: 

(3.14)  g{u^*^')  ^  Ug. 

s 

Clearly,  the  weak  limits  u  depend  linearly  and  positively  on  g.  It  is 
not  hard  to  show  that  u  can  be  represented  as  an  integral  of  g  with 
respect  to  a  family  of  probability  measures   v(y): 

(3.15)  Ugfyl  =  J  g(v)  dv(v,y|   =   <  g,v(y)  >. 

Secondly,  it  is  not  hard  to  show,  see  (3.12)',  that  (3.14)  is  convergence 
in  the  L"^  sense  if  the  measure  v(y)  is  concentrated  at  a  single  point 
for  almost  aii  values  of  y.     In  this  case 

<3.:6t    Ug  =  g(u) 

for  ever>-  g.  u   being  the  limit  of  u      . 

Tartar  takes  for  u  the  solutions  of  (2.17)  with  prescribed 
initial  values.  By  Theorem  3.2  this  sequence  is  uniformly  bounded; 
therefore,  it  has  a  subsequence  for  which  (3.14),  (3.15)  holds.  Tartar 
shews  that  in  this  case  v  is  concentrated  at  a  single  point  by  using 
the  notion  of 

CoBpensated  Compactness  (Tartar): 

Let  h  and  k  be  two  sequences  of  vector  functions  defined 
in  some  domain  of  y-space.   satisfying  the  following  conditions: 
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(i)  h        and  k        are  uniformly  bounded  in  L  ,   and  converge  weakly 

in  L^: 

(3.17)  h^^'  ^  h.  k<^>  -  k. 

(ii)         -div  h        and  curl  k        belong  to  compact  subsets  of  Hj^^. 

Conclusion:         The     scalar     product     of     h  and     k  tend     in     the 

distribution  sense  to 

(3.18)  h'^^-k'^^  --  h-k. 

Take   y  to   be   t,   x,   the   domain   to   be   (O.T)  x    [x.,,X2]    and  the 
vector  function  h        to  be 

(3.19)  h<^'  =  (u<^*,  f^). 

where  f        =  f(u^).      UsLig   Equation   (2.17)  we  see  thai 

(3.20)  div  h<^>  =  u^^>   -f^x^   =   ^   <x 

Clearly,    the    H,^^    norm    of    div    h         is    bounded    by    e    llu^ll-    where 

2 
II   II    denotes   L     norm.      This   cruantity   can   be   estimated   by  multiplying 

(2.17)   by  u        and   integrating   over   K  x   (0,T).      Since  uf     =  uf'(u)u     is 

a    perfect    x    derivative,    and    u         tends    to    zero    as     lxl-.»,    we    get, 

after  integrating  byparts  that 

(3.21)  -   f   lu'^^'l^dx    I      =  -€    f      lu^f^^dxdt 
2    J  'o  Jo 

It  follows   that 

1 


(3.21)'    e    llu^^^l^  =   e    ('  {    I  u  ^  ^  "^  I  ^dxdt   $   const  =  -  J  " 


ndx. 


ifi 


Clearly,    ellu    ^11    ^    c  const    tends    to    zero    as    c-»0;    this    shows 

that  div  h        belongs  to  a  compact  set   in  H7 

To    construct    k     '    we    take    any    C      function    7)(v),    and    define 


^(v)  by 


sc( 


v)  =    r        7)'{u)f'(u)Xu. 


Then 


(3.221     so'  =   7?'f 

Multiply  (2.17)   by    ;?';   using   (3.22)  we   can  write  the  resulting  equation 
as 

(3.23)     ,^l^   ^^^l^   =   e;?'u<^>   =   cn^^   -   c;,-u^^>' 

X 

We  take  now  the  vector  function  k        to  be 


(3.24)    k^^'  =    fso'^>,-/?'^>] 


Using  (3,23)  we  see  that 

(3.25)    curl  k'^'  =   :i^f^   *  '^^^   =   ^r^iV   -   c;?"u^^>' 


'  y.  X 


We   claim   that  the  right   side  lies  in   a   compact  set  in  H^^^.      Clearly, 
the  H^Q^   norm  of  the  first   term    ^''xx      ^®  bounded  by 


It   foLlcws,    as   before,   from   (3.21)'   that   this   tends   to  zero.      It  follows 

2 

from  (3.21)  that  the  second  term  in  (3.23),  er^'u^^^  ,  is  bounded 
Id  L  .  This  does  not  imply  H7'  _  compactness;  .to  get  that  we  note 
that     since     by    Theorem     3.2.     u  ^      is    uniformly     bounded    in    L    ,     it 
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follows      that      so      are      n  and      ^'^'.  It     follows      that      curl 

jj(€)  =;,<£>     +  v^^^     belong    to    a    bounded    set    in    W~^'~.        Now 
Tartar  appeals  to 


Murat's  Lemma: 

Suppose  the  set  of  functions   Cc3   satisfies  these  conditions: 

1     00 

(a)  Cc3  belongs  to  a  bounded  set  in  W~  '    . 

(b)  Each  c  can  be  decomposed  as 

c  =  cj  +  C2 

where  Cc-i3  belongs  to  a  compact  set  in  H^^^.  and  (^c^^  to  a 
bounded  set  in  L  . 

Assertion:     The  set   Cc}   belongs  to  a  compact  subset  of  H^^^. 

Having  verified   the   hypotheses  of  compensated   compactness,   we 
can  conclude  as  in  (3.18)  that 

(3.26)  u<*^V'*^*  -  f^*^*/?'^*  -  uv(u)  -  f(u);?(u) 

We  use  now  formula  (3.15)  describing  the  weak  limits  of  functions  of 
u  ;  taking  g(v)  =  Vfiiv)  -  f(v)7?(v)  we  see  that  the  weak  limit  of  the 
left  side  of  (3.26)  is 

<W   -  fT},V> 

The  various  terms  on  the  right  of  (3.26)  can  be  similarly  described; 
since  the  two  sides  of  (3.26)  are  equal,  we  conclude  that 

(3.27)  <vv  -  {n.v>  =  <\,v><^.v>  -  <f.v><7?,i'>, 
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where   v  =   v(x,t);  (3.27)  holds  for  a.a.(x,t). 
We  introduce  the  abbreviations 

(3.28)  u  =  <v,J'>,  f=<f(v),i'>; 
then  (3.27)  can  be  rewritten  as 

(3.29)  <{v-\i)v>,v>  =  <(f-f)7},v>. 

In  the  derivation  of  (3.27)  we  used  second  derivatives  of  /?, 
but  (3.27)  itself  depends  continuously  on  rj  in  the  C  topolog>-; 
therefore,  (3.37)  remains  true  for   n  piecewise  C'.     We  choose 

(3.30)  7?(v)  =    Iv-ul; 

then  from  (3.22) 

rf(v)     -     f(u)     for     u>v 
(3.30)'    ¥'(v)  =  \ 

[f(u)     -     f(v)     for     u<v 

Setting  these  choices  for   ti  and  V  into  (3.29)  gives 

<l  v-ul  (f-f(u)),v>   =  <(f-f)l  v-u,v>; 
we  deduce  from  this  that 

(3.31)  (f(u)-f)  <lv-ul,i'>  =  0. 

For  f  strictly  convex,  it  follows  from  (3.28)  and  Jensen's  inequality 
that  the  first  factor  (f(u)-f)  in  (3.31)  is  positive  unless  v  is 
concentrated  at  the  single  point  u;  the  same,  of  course,  is  true  for 
the  second  factor  <  I  v-u  !.>'>.  Thus,  it  follows  from  (3.31)  that  v  is 
concentrated   at   the   single   point    u,    and   so   u        tends   to   u   in   the   L 
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sense.  It  follows  then,  as  shown  at  the  beginning  of  this  section,  that 
u  =  lim  u        satisfies  (2.1)  in  the  sense  of  distributions. 

The  argument  outlined  above  shows  that  for  every  sequence  of 
€-.0  we  can  select  a  subsequence  such  that  the  solutions  u  '  of 
(2.17)  with  prescribed  initial  value  Uq  tend  in  the  L  sense  to  a 
distribution  solution  u  of  (2.1)  with  initial  value  Uq.  To  prove  that 
lim   n        exist,   we  have  to  show  that   any  two  subsequences  have  the 

€-►0 

same  limjt.  For  this  we  need  the  following  characterization  of  such 
limits,  see    [11]: 

Theorem  3.4:  Let  u  be  the  L  limit  of  a  subsequence  u  '  of 
solutions  of  (2.27).  Let  /?  be  any  convex  function,  and  V  related  to  tj 
by  (3.22).     Then 


(3.32)     ;?(u)^  +  SO(u)jj  $   0 

in  the  sense  of  distribution. 

The     proof     follows     from      (3.23);      for     when      ;?      is     convex. 
n"  ^  0,  and  so  (3.23)  implies 


/5^  -  ^^V  ^  €, 


(€) 
XX 


(3.32)  is  the  limit  in  the  distribution  sense  of  this  relation  as   cO. 

Condition  (3.32)  is  called  an  entropy  condition;  this  notion  will 
be  elaborated  in  Sections   5  and  6. 

It  can  be  shown  that  distribution  solutions  of  (2.1)  that  satisfy 
the  entropy  conditions  (3.32)  are  uniquely  determined  by  their  initial 
data.  This  proves  that  the  L'  limits  of  two  different  subsequences  of 
u        are  the  same;  this  completes  the  proof  of  convergence. 

Tartar      observed      that      the      argument      above      for      the      L 
convergence  of  u        also  proves  the  compact  dependence  of  solutions  u 
on    their   initial   data.      For    let   u^'^      be   a    uniformly   bounded   sequence 

s 

of    data    with    common    support,    converging    in    the    weak      topology    of 
L°°     to     Uq;     denote     by     u  the     corresponding     solutions     of     (2.1) 
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constructed  by  the  viscosity  method.  A  subsequence  can  be  selected 
so  that  for  every  continuous  g,  g(u  )  converges  in  the  weak  topology 
of  L    .     Again,  we  look  at   the  two  vector  valued  functions 

By  (2.1), 

grad  h'"'  =  0. 

and  by  taking  the  limit  of  (3.5) 

curl  k'"* 

are  measures,  uniformly  bounded.  Using  compensated  compactness  we 
conclude  as  before  that  u  converges  in  the  L  norm  to  a  limit  u,  a 
solution  of  (2,1)  with  initial  value  Uq.  Since  each  u"  satisfies  the 
entropy  inequality  (3.32).  taking  the  distribution  limit  of  (3.32)  shows 
that  so  does  the  limit  u.  Then  by  the  uniqueness  theorem  quoted 
above  we  conclude  that  u  is  the  solution  of  (2.1)  with  initial  value  Uq 
obtained  by  the  viscosity  method.  Thus,  the  L  limits  of  two  different 
subsequences  of  u  are  the  same:  this  completes  the  proof  of  the 
compactness  of  the  mapping  Uq-.u  from  L      to  L  . 

We  close  this  section  by  remarking  that  compactness  is  a 
property  one  usually  associates  with  the  dependence  of  solutions  of 
parabolic  equations  such  as  (2.17)  on  their  initicil  data;  unlike  in  the 
linear  case,  this  property  is  preserved  as  €  tends  to  zero.  It  is 
instructive  to  see  in  some  detail  how  this  happens.  From  (3.21)  we 
get  that 


d       /.    I  u 


(€)  .  2 


el, 2. 


(3.33)    —    f  dx  =  -e    f    (u'"'l'-dx. 

Gt    J  2  J 
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i.e.,  that  e  I  u  dx  is  the  rate  of  energy  dissipation.  We  saw  earlier 
that  the  limit  u  of  u  has  discontinuities;  it  can  be  shown  that,  for  c 
small,  u  bridges  this  discontinuity  in  a  layer  £{€)  of  width  c;  in 
this  layer  it  has  the  shape  of  a  traveling  wave: 


(3.34)    u**^*(x,t)   ^  w 


,t    . 


where  w(f)  is  a  solution  of 

-sw  +  f'w  =  w, 


d 


Differentiating  (3.34)  we  see  that 
1 


e 
i(e) 


J,u<J>,2dx.-J-(:-nj2,i,  =  Jo,c,2.r 


This   shows   that   the   rate   of   energy'   dissipation   does   not   tend   to   zero 
as   e   tends  to  zero.     This  is  in  sharp  contrast  to  the  linear  case: 


.<5> -"<;>. 


when    u         tends    to    a    discontinuous    limit.       Here    the   transition    layer 

1/2 
has  width    e        ,   and   the  shape  of  the  wave  is 


u^^*(x,t)   ::   w 


1  /2 


So  the  rate  of  energy  dissipation 


I-'"'''Il47I^  =  '"M-« 


tends  to  zero  as  c  tends  to  zero. 
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4.  Consequences  of  the  Shock  Condition 

In  this  section  we  study  those  distribution  solutions  of  (2.1)  all 
of  whose  discontinuities  are  shocks,  i.e.,  satisfy  condition  (2.23).  We 
present  a  result  of  James  Glimm  that  shows  that  such  solutions  form  a 
compact  set. 

The  shock  condition  relates  the  signal  velocities  a,  and  ap  to 
the  left  and  right  of  the  discontinuity  with  the  velocity  s  with  which 
the  discontinuity  propagates: 


(4.1)      a,    >  s  >  a 


R 


(4.1)  expresses  the  fact  that  characteristics,  i  e.,  curves  propagating 
with  signal  speed  drawn  in  the  forward  direction,  intersect  the  shock. 
It  follows  from  this  that  characteristics  drawn  in  the  backward 
direction  cannot  intersect  an>'  shock.  It  follows  from  this  that  in  a 
solution  ail  whose  discontinuities  are  shocks,  every  point  (x,t)  can  be 
connected  by  a  characteristic  to  a  point  on  the  initial  line. 

For  simplicity  we  shall  study  solutions  u(x,t)  whose  initial 
data — and,  therefore,  themselves — are  periodic  functions  of  x  with 
period  L: 

u(x  +  L.t)  =  u(x,t). 

We  shall  estimate  the  total  variations  of  u(x,t)  with  respect  to  x  per 
period.  We  start  by  estimating  the  total  variation  of  a  =  a(u(x,t));  the 
total  variation  is  the  sum  of  the  increasing  variation  A  and  the 
decreasing  variation  A~;  for  a  period  function  the  two  are  the  same: 

(4.2)  Total  variation  of  a  per  period  at  time  t  =  2A  (t) 

To    estimate    the    increasing   variation    of   a    we   note    that    according  to 

the    shock    condition    (4.1),    the    discontinuities    of   a    contribute    only  to 

the  decreasing  variation  of  a.  Therefore,  we  can  calculate  A  (t)  by 
dividing  an  intervai   of  length  L  at  t   into  subintervals  by  points 
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IqM     <     Xj{t)    <      •      <     Xfr(t)    =    Xglt)    +     L 


so   that   a   is   aiternateiy   increasing   and   decreasing   along   each   inter\'al. 
Then 

(4.3)       A"(t)  =  L  32^^-    -  a2„. 

where 


(4.41      aj  =  a(u(Xj,t)). 

We    connect    now    the    points    Xj,t    to    points    y.    on    the    initial    line    by 
characteristics: 


A 


y.      -'^  y-L-         ^r       ^ 


H h- 


'^h    '^  1^''>' 


By   (2.5) 

(4. .5)       Xjft)  =  y.  +  a;t 

Since  x>.  =  Xr,  +  L. 


Z  \X':\    -  Xo„(t)   $   L 


Using  formuia  (4.5)  for  x,  we  gei 

-  2n  +  l        -  2r  2n+l  2n 
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Since  the  first  sum  is  positive,  we  conclude,  using  formula  (4.3),  that 

A  (t)  $  -. 

t 

So  by  (4.2)  we  conclude  that 

2L 

(4.6)  Total  variation  of  a(u(x,t)l  per  period  ^  — 

t 

We  use  now  the  fact  that  (2.1)  is  genuinely  nonlinear,  i.e.,  that  a  is 
nonconstant  function  of  u.  Denote  R  a  lower  bound  for  I  a'(u)  I  ; 
then  Total  Var  a(u)  5   R  Total  Var  u,  so  we  conclude  from  (4.6): 

Theorem  4.1  (Glimm):  Suppose  equation  (2.1)  is  genuinely  nonlinear  in 
the  sense  that 

(4.7)  ir(u)l    ?   R. 

Let  u  be  a  distribution  solution  of  (2.1)  all  whose  discontinuities  are 
shocks,  and  which  is  periodic  in  x  with  period  L.  Then  the  total 
varia:ion  of  u(x,t)  in  x  per  period  is 

2L 

(4.8)  $  . 

Rt 

It  follows  from  (4.8)  that  for  any  t>0,  the  measures  u  (x,t)  are 
uniformly  bounded  and,  therefore,  weakly  compact;  it  is  remarkable 
thai  nothing  need   be  assumed  about  the  initial  data. 

We  can  obtain  a  compactness  result  about  the  u  themselves  if 
we  can  get  bound  for  their  integral.  This  is  easy  because  of  the 
conservation  form  of  the  equation:  if  we  integrate  (2.1)  with  respect 
to  X.  we  deduce  ^hat 


(4.9) 

0 


u(x.t)cx 
is  independent   of  t. 
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There  is  an  analogue  of  (4.8)  for  solutions  that  are  not  periodic 
in  X  but  of  compact  support.  The  total  vjiriation  of  u(x,t)  in  x  is  less 
than 


(4.8)' 


1/2 


where  the  constant  depends  on  the  length  of  the  support  of  the  initial 
data  but  on  nothing  else,  see    [12]. 

We  conclude  this  section  by  showing  that  solutions  which 
satisfy  the  shock  condition  (4.1)  satisfy  the  entropy  condition  (3.32)  as 
well,  and  conversely.  Suppose  that  f  is  convex,  so  that  a  is  an 
increasing  function  of  u;  then  (4.1)  is  equivalent  with 

(4.10)  u^  >  Uf^. 

For    a    piecewise    continuous    solution,    the    left    side    of    (3.32)    is    zero 
except  along  a  discontinuity  x  =  y(t),   where  it  has  the  value 

(4.11)  in^^  -   r}^)s  -  (S^L  -   *^r' 

times    5fx-y{t)):  here  s  denotes  the  shock  velocity 

dy 
dt 

Since  by  (4.10),  Ud   <  u,  ,  we  can  write  (4.11)  as 
(4.11)'  (;?  s  -   '^•)dw. 

Using  relation  (3.22)  this  can  be  written   as 

(4.11)"    f    ^   ;?(s  -  f')du 
J  II 
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We  recall  now  the  Rankine-Hugoniot  condition  (2.9): 

this  can  be  expressed  by  saying  that 

f(u)  -  su 

has  the  same  value  at  u^  and  Uj^;  denoting  this  common  value  by  g 
can  integrate  (4.11)"  by  parts  to  obtain 


n    (f 


N.ll)"'  "-  ;?"  (f  -  su  -  g)dn 


Since  n  is  assumed  convex,  /?"  >  0;  since  f  is  assumed  convex,  it  lies 
below  any  secant,   i.e. 

f   <   su   T   g 

for  u  between  Uj^  and  u^.  This  shows  that  (4.11)'"  is  <  0;  thus,  so  is 
(4.11),  and  the  entropy  condition  (3.32)  is  fulfilled.  The  proof  of  the 
converse  is  the  same. 


5.  Systems  of  Conservation   Laws 

In  this  section  we  note  very  briefb'  the  extension  of  the  ideas 
in  Sections  2,  3  and  4  to  systems  discussed  in  Section  1.  First  of 
all,  we  have  to  assume  that  the  matrix  A  given  by  (1.3)  and  appearing 
in  Equation  (1.2), 


(5.1) 


u^  *  Au^  =  0, 


has  real  and  distinct  eigenvalues  a,<a2<...<a  ;  this  makes 
equation  (5.1)  h>T5erboIic  The  eigenvalues  a  are  functions  of  u,  and 
so  are  the  eigenvectors  r: 
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(5.2)      Ar  =   ar. 

Genuine  noniinearity,  see    [8],  is  defined  to  be 


(5  3)       a^    •    fj  *  0 


for  j  =  1,  2 n:  here  '  denotes  the  gradient  with  respect  to  u. 

The  curves  that  satisfy 


dx 

dt 


(5.4)      —  =  a- 


are  still   called   characteristics,   but  they  are  no  longer  straight  lines  in 
general. 

A    function    w    of    u    is    caled    a    Riemann    invariant    for    the    j 


,-th 


field  if  i  for  any  solution  u  of  (5.1),  w(u)  is  constant  along 
characteristics  of  the  j  field.  It  follows  easily  that  this  is  the  case 
if  w',   the  gradient   of  w,   is  a  left   eigenvector  of  A; 


(5.5)      w'A  =   a-w' 


For  then  multiplying  (5.1)   by  w'  gives 


(5.5)'     w^^  +   QjW^   =  0. 


Since   eigenvectors   can    be   rescaled,    system    (5.5)   can   always   be   solved 
when  n  =   2:  for  n>2  solutions  exist   only  in   exceptional  cases. 

As  in  the  scalar  case,  solutions  in  the  classical  sense  cannot  be 
continued  beyond  a  finite  time,  see  [103  and  [6],  so  again  we  have 
to  turn  to  distribution  solutions.  As  before,  a  piecewise  continuous 
solution  is  a  distribution  solution  if  the  Rankine-Hugoniot  condition 
(2.9)  is  satisfied  for  all  n  components.  Again,  the  class  of  distribution 
solutions  is  too  broad,  and  has  to  be  narrowed  by  imposing  some 
criterion  of  acceptance  and  rejection.  The  ones  that  make  physical 
and  mathematical  sense  are  the  same  ones  that_  were  employed  for 
scalar  equations: 


30 


(a)  The  acceptable  solutions  are  limits  of  solutions  of  a  family  of 

equations  obtained  by  augmenting  the  flux  by  a  small  viscous  term, 
and  letting  the  viscosity  tend  to  zero.  The  simplest  way  of 
augmenting  the  flux  leads  to  the  parabolic  systems 

<5  6)      u,  .  f^  =   cu^, 

or  more  generally 


(5.6)'      u^  .  f^  =   cDu^^, 


D  some  sjinmetric,  positive  matrix.  We  remark  that  physical  viscosity 
and  heat  conductivity  lead  to  a  matrix  D  that  is  merely  nonnegative, 
and  is  a  function  of  u.  For  this  reason  equation  (5.6)  is  said  to 
contain  artificial  viscosity. 

In  [11],  an  entropy  condition  is  formulated  for  limits  of 
solutions  of  (5.6).  Two  functions  rf(u)  and  V(u)  are  needed,  with 
these  properties: 

(il  7>  and  V  satisfy  the  differential  equation 

(5.7)       ;;  A  =  <fi\ 

where   r;'  and  V'   denoted  gradients  with  respect  to  u. 

(ii)  7?(u)  is  strictly  convex 

It      follows      from      (i)      that      if      u'^'      satisfies      (5.6),      ;?'^* 
=   ;?(u*')    and     S"*^*     =  "^(u'^*)    satisfy    the    vector    analogue    of    (3.23). 
From    this    relation    and    (ii)    we    deduce    that    the    L      limit    u    of    u 
satisfies  the  entropy  inequality  (3.32). 

For  n  =  2,  equation  (5.7)  has  many  convex  solutions.  For 
n  >  2,  (5.7)  is  an  overdetermined  system  which  only  exceptionally  has 
a  solution:  these  exceptional  cases  happib'  include  most  of  physical 
interest,  see  Section  6. 
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Theorem  3.1  has  no  analogy  for  systems;  nor  does  E.  Hopf's 
trick  of  linearizing  (5.6)  work  for  n  >  1.  Happily  Tartar's  approach  is 
not  tied  to  n  =  1,  and  in  fact  Ron  Di  Perna  succeeded  in  extending  it 
to  significant   cases  for  n  =  2,   see    [1]    and    [2]. 

(b)  The  finite  difference  approach  analogous  to  (2.19)  can  be  set 
up,  as  well  as  other  more  general  ones.  These  work  very  well  in 
practice — some  better  than  others — but  it  is  very  hard  to  prove 
anything  rigorously  about  their  convergence.  In  [5]  Glimm  succeeded 
in  proving  the  a.e.  convergence  of  a  scheme  that  is  the  mixture  of  a 
discrete  scheme  due  to  Godunov  and  a  Monte  Carlo-type  scheme. 

(c)  The  notion  of  a  shock  can  be  extended  to  piecewise  continuous 
solutions  of  hyperbolic  systems  of  conservation  laws,  as  follows.  In 
place  of  condition  (4.1)  we  require  that  there  be  an  index  k  such  that 


^k  <"l'  "  '  '   °k-l  <"L*' 
(5.8) 

°k-l  <"r'  '  ^  ""   °k  *"R> 

Thus,  there  are  n  distinct  families  of  shocks,  corresponding  to  k  =  1, 
....  n. 

In  a  k-shock,  n  -  k  -f  1  characteristics  impinge  from  the  left 
on  the  line  of  discontinuity,  and  k  from  the  right,  altogether  n  +  1. 
The  information  carried  by  these  characteristics  combined  with  the 
Rankine-Hugoniot  relations  (2.9)  serve  to  determine  uniquely  the 
solution  on  either  side  of  the  curve  of  discontinuity,  and  determine  the 
curve  itself. 

Glimm's  meihod  for  estimating  the  total  variation  of  solutions 
sadisfving  the  shock  condition,  described  in  Section  4.  has  been 
extended  in  [5]  to  solutions  with  small  oscillation  of  pairs  of 
conservation  laws 

It  was  shown  in  [11]  that  a  piecewise  continuous  solution  that 
satisfies  the  shock  condition  also  satisfies  the  entropy  condition,  at 
least  for  sufficiently  weak  shocks. 
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6.  ThermodN'n amies  and  Gas  Dynamics 

There  are  many  thermodynamicai  variables;  some  of  them,  like 

Density,   p 
Temperature,  T 
Pressure,  p 
Soundspeed,   c 

are  quantities  palpable  in  everyday  experience.     Others,  such  as 

Internal  Energy,   e 
Entropy,  S 
Enthalpy,  i 

per  xmit  mass,  are  theoretical  constructs.  In  a  single-phase  system  at 
thermodv-namic  equilibrium  any  three  are  functionally  related,  i.e.,  are 
linked  by  an  equation  called  equation  of  state.  Any  of  the  variables 
can   be  expressed  as  function  of  any  other  two. 

The  equations  governing  the  flow  of  compressible  gas  are  the 
laws  of  conservation  of  mass  momentum  and  energy.  Let  p,  m  and  E 
denote  mass,  momentum  and  total  energ>'  per  unit  volume,  u  the 
velocity  of  the  fiow,   set 


2 
(6.1)      m  =  /3u,   E  =   pe  +  —   4>u  . 


The   conservation    laws   for   flows   depending   on   a    single   space   variable 
X  are; 

^  -  m^  =  0 
(6.21      m^  +  (um  +  P)^  =  0 
E^  +  (u(E  +  p))j^  =  0. 

It   is   not   harti  to  show  that  this  system   is  hyperijolic;   the  three   signal 
veiociues  are 
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j-Sp-,1/2 

(6.3)  a,   =  u  -  c,   a-,  =  u,   a~  =  u  +  c,  c  =     — 

1  2  3  ^g^j 

a-,  and  a.^,  are  genuinely  nonlinear  in  the  sense  of  (5.3).  The  signal 
velocity  a^  =  u  is  linearly  degenerate;  the  corresponding  characteristic 
curves  are  particle  paths. 

It    follows    that    if    equations    (6.2)    are    satisfied    at    every    point, 
then 

(6.4)  S,.  +  uSjj  =  0. 

In  words:  in  a  smooth  flow  entropy  per  unit  mass  is  constant  along 
particle  paths. 

Combiriing   equation   (6.4)  with   (6.2)^  we  deduce  the  conservation 
law 


(6.5)      (pS),    +  (upS)j^  =  0 


for  entropy  per  unit  volume.  It  is  not  hard  to  show  that  pS  is  a 
concave  function  of  p.  m  and  E:  since  (6.5)  is  a  consequence  of  (6.1), 
the  pair 

T}  =  pS  and  V  -  upS 

must  satisfy  relation  (5.7).  Thus,  it  follows  that  distribution  solutions 
of  (6.1)  which  are  the  limits  of  artificially  viscous  flows  (5.6)  satisfy 
the  encropy  inequality  (3  32): 

(6.6)      (pS)^  ^  (upS)j^  ?   0. 

We  have  seen   in   equation  (4.11)  that  for  a  piecewise  continuous 
solution  relation  (6.6)  means  that  across  each  discontinuity 


[{pS)^  -  (/jS)j^]    s  -  (upS)^  +  (upS)p^  5 


0 
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Rearranging  turns  this  into 

(6.7)  (s  -  u^)p^S^^  -  (s  -  Uf^l/'^Sf^  ^   0. 

According  tc  the  Rankine-Hugoniot  relation  applied  to  the  conservation 
of  mass  equation  (6.1)-., 

Sifi^    -    Pj^    =    (u/>)j^    -    (U/3>j^, 

which  can  be  rearranged  as  below: 

(6.8)  (s  -  ^i)Pi^  =  (s  -  Uj^)/3j:^- 

We    turn    now    to    the    shock    relations    (5.8);    for    k    =    1    we    get 
from  (5.8),  using  (6.3),   that 

(6.9)-,     Ut    >  s,  Ud   >  s 

while  for  k  =  3  we  get   that 

(6.9)2    "L  ^  ^"  "R  ^  ^ 

Thus  in  case  k  =  1,  (6.8)  is  negative,  in  case  k  =  3,  (6.8)  is  positive. 
Setting  this  into  (6.7)  we  conclude  that  for  k  =   1 

(6  10).  S^  <  Sf^, 

while  for  k  =  3, 

(6  10)3  S^  >  Sp. 

It  follows  from  (6.9).  that  in  case  k  =  1,  particles  cross  the  shock 
from  left  to  right,  while  (6.9)o  says  that  in  case  k  =  3  particles  cross 
the  shock  from  right  to  left.  So  both  (6.10)^  and  (e.lO)^  can  be 
summarized  in  the  single  statement 
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Theorem  6.1:     When  gas  crosses  a  shock,  its  entropy  increases. 

We  snail  now  derive  an  integral  form  of  the  increase  of 
entropy.  We  assume  that  the  gas  is  in  the  same  state  and  that  it 
travels  at  the  same  velocity  at  x  =  =«  as  at  x  =  -».  We  normalize 
entropy  so  that  its  value  =  0  at  x  =  *«■.  Then  integrating  (6.6) 
with  respect  to  x  gives 


(6.11)    —       pS  dx  ^   0; 
dt    J 


in  words. 

Theorem  6.2:  The  total  amount  of  entropy  in  the  flow  field  is  an 
increasing  function  of  time. 

We  compare  now  this  result  with  Glimm's  result  in  Section  4, 
according  to  which  the  total  variation  of  a  solution  tends  to  decrease 
with  time.  This  decrease  of  total  variation  can  be  thought  of  as  a 
loss  of  detai"..  or  loss  of  iri'ormation.  Ever  since  Maxwell's  demon, 
BoIt2manR's  H  theorem,  Smoluchowsky's  study  of  fluctuations,  Szilard's 
thesis  and  Shannon's  work  on  information  and  its  transmission  there 
have  been  many  attempts  to  link  increase  of  entropy  to  decrease  of 
information  about  the  gas.  We  have  shown  here  that  shock  waves  do 
both:  increase  entropy  and  decrease  information,  although  not  on  the 
molecular  level  but  pertaining  to  organized  motion  in  the  whole 
riow-fieid. 

We  turn  now  to  another  manifestation  of  the  second  law  of 
thermodynamics. 

Carnoi's  Theorem:  Consider  an  engine  that  extracts  heat  energy  in 
the  amount  Q-  from  a  hot  reservoir  whose  temperature  is  T- ,  and 
dumps  Q  amount  of  heat  energy  into  a  cold  reservoir  whose 
temperature  is  T  :  the  difference  is  turned  into  mechanical  energy 

W  =  Q.  -  Q.. 
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The  engine  operates   cyclically,   i.e.,   everything  is  in  the  same  state  at 
the  end  of  the  cycle  as  at  the  beginning. 

Assertion:     The  efficiency  of  such  an  engine  in  converting  heat   energy 
into  mechanical  energy  cannot  exceed 

T.  -T 
h        c 


Proof:  During  the  absorption  of  heat  energy  in  the  amount  of  Q, 
from  a  reservoir  at  temperature  T  ,  the  engine's  entropy  is  increased 
by  the  amount 


(6.12)     S  =  ~ 
^  h 


Since   at   the   end  of  the   cycle   the   engine   returns   to  its   original   state, 

this    amount    of    entropy    must    be    returned    to    the    outside    when    Q 

c 

amount     of     heat     is     released     to     a     reservoir     at     temperature     T   . 
Assuming  that  no  further  entropy  is  generated  during  the  cycle, 

(6.12)'   S  =  —. 
c 

The  amount   of  mechanica;   energy  W  extracted  is  the  difference  of  the 
heat   energy  absorbed  and  released: 


W  =  Q.    -  Q^. 


Using  (6.12)  and   (6.12)'  we  get   the  following  formula  for  the  efficiency 
W/Q^: 

0,-0  ST,   -ST  T,   -T 

W  he  h  c  he 

(6.13)     =  =  =  

Ov.  0,  ST.„  T, 


This  proves  the  theorem.  Note  that  if  there  is  entropy  production 
duririg  the  cycle,  e.g.,  by  shocks  produced  by  the  operation  of  the 
engine,  then  an  additional  amoant  S'  of  entropy  is  produced  and  has  to 
be  gotten  rid  of.     In  this  case,   relation  (6.12)'   has  to  be  modified  to 

(6.12)"  S  +  S'  =  7~. 
c 

Clearly,  this  lowers  the  efficiency  to 

Ciearb'.  for  sake  of  efficiency  one  must  keep  the  entropy  production 
and.  thus,  the  strength  of  shock  waves  formed  during  the  operation  of 
the   engine  to  a  minimum. 

What  does  Carnofs  theorem  impiy  for  gas  d>-namics?  Imagine 
an  infinite  tube  with  unit  cross  section,  filled  from  a  cold  reservoir, 
except  for  a  finite  section  (-L,L)  which  is  filled  from  a  hot  reservoir. 
The  initial   data  are 


Uq(x)   =   0 


:l 
(6.241     ej-|(x) 


Te        for      1  X  I  <1 
L  e        for      I  X  I  > 1 

fT,       for      |x|<L 
To{x>  =         ^ 
"  [t        for      I  X  I  >L 

The  added  heat  energy  is 

2L/.(ej^  -   ^c' 

According   to  Carnot's   theorem,   the   amount   that   may   be  converted  into 
mechanica;   energ>'  does  not   exceed 
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Therefore,     kinetic     energy,     which    is    mechanical    energy,    can    at    no 
future  time  exceed  this  amount. 

This  argument  can  be  extended  to  variable  initial  data: 


^0 
(6 


Unix)  =  0  for    I  x  I  >L 


r^  e  for  i  X  I  < 

14)'   ef.{x)  I  "" 

"  L=  e^  for  I  XI  > 

r^  T^  for  I  X  I  <1 
TqIx)    \ 

"  L=  T^  for  |>L. 


We  can  think  of  constructing  such  data  by  filling  a  finite  section  of 
the  tube  with  gas  taken  from  a  collection  of  reservoirs  and  setting  it 
in  motion.  The  amount  of  heat  energy  that  can  be  converted  to 
mechanicai   energ>"  is 


(6.15)         CrAx}er,{x)      -r    /  ^  \         dx 


J    '^O'^^^^O 


To(x) 


Kinetic  energ>'  at  any  time  t>0  cannot  exceed  this  amount  plus  the 
kinetic   energ>'   initiall:.'  imparted  to  the  sj'stem: 

ThecreiE  6.3:  Let  u.  p.  e.  T  denote  the  velocity,  density,  internal 
energ>'  and  temperature  of  a  gas  moving  in  a  tube,  whose  initial  data 
satisfy  the  restrictions  (6.14)'.     Then  for  ail  t>0 


^    c       7  '    r         2  r  —2 - 

lb)    -  J    pu-dx  $  _  J    pguodx  *  J    p^e^    ^^      d 


This    inequality    holds    in    any    number    of    dimensions;    it    seems 
desirable  to  obtair:  a   proof  of  it  by   PDE  methods. 
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We  conclude  by  recounting  a  curious  result  about  strong  shocks 
that  seem  to  echo  a  theme  in  kinetic  theory.  The  Rankine-Hugoniot 
relations  (2.9)  for  the  conservation  laws  of  momentum  and  energy,  see 
(6.1),   are 

s[m]    =    [um+p], 
(6.17) 

s[E]   =    Cu(E+p)]. 

Assume  that  a  strong  shock  is  impinring  on  a  gas  at  rest;  we  denote 
the  quantities  behind  the  shock  by  capital  letters:  E,  M,  P,  in  the 
front  by  lower  case  letters:  e,  ra,  p;  note  that  u  =  m  =0.  We 
rewrite  (6.17)  as 

sM  =  UM+P-p 

s(E-e)  =  UE+rP 

We  solve  the  first   equation  for  s  and  set   it  into  the  second: 

P-P. 

U+ (E-e)  =  UE+L'P. 

.        M    J 

from  which 

P-p 

(E-e)  =   U(P+e) 

M 

M 
Multiplying   by  —  we  get 
P 

,6.1S,     p    -  1]     (,   -  Ij    E  =  ML-    [l   .  1) 

For  a  strong  shock,   p  P,   e'E  and  e/'P  are  small:  so  we  get 
(6.18)    E  =  MU  (l  +  €). 


€    small.      Thus,    the   total    energy    E   behind   the   shock   is   approximately 

twice  the  kinetic  energy  — MU.     In  words: 
2 

Theorem  6.4:  When  a  strong  shock  impinges  on  a  gas  at  rest,  the 
energy  imparted  to  the  gas  is  equipartitioned;  that  is,  approximately 
half  of  it  goes  into  kinetic,  the  other  half  into  internal  energy. 

Note  that  it  follows  from  (6.18),  (6.18)'  that  the  internal  energy 
is  always  a  little  greater  than  the  kinetic. 

One  would  like  to  know  what  this  kind  of  equipartition  of 
energy  has  to  do  with  the  equipartitioning  that  is  the  hallmark  of 
thermodjTiamic  equilibrium. 
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